Problems with Using the Normal Distribution – and Ways to Improve Quality and Efficiency of Data Analysis

نویسندگان

  • Eckhard Limpert
  • Werner A. Stahel
چکیده

BACKGROUND The gaussian or normal distribution is the most established model to characterize quantitative variation of original data. Accordingly, data are summarized using the arithmetic mean and the standard deviation, by mean ± SD, or with the standard error of the mean, mean ± SEM. This, together with corresponding bars in graphical displays has become the standard to characterize variation. METHODOLOGY/PRINCIPAL FINDINGS Here we question the adequacy of this characterization, and of the model. The published literature provides numerous examples for which such descriptions appear inappropriate because, based on the "95% range check", their distributions are obviously skewed. In these cases, the symmetric characterization is a poor description and may trigger wrong conclusions. To solve the problem, it is enlightening to regard causes of variation. Multiplicative causes are by far more important than additive ones, in general, and benefit from a multiplicative (or log-) normal approach. Fortunately, quite similar to the normal, the log-normal distribution can now be handled easily and characterized at the level of the original data with the help of both, a new sign, x/, times-divide, and notation. Analogous to mean ± SD, it connects the multiplicative (or geometric) mean mean * and the multiplicative standard deviation s* in the form mean * x/s*, that is advantageous and recommended. CONCLUSIONS/SIGNIFICANCE The corresponding shift from the symmetric to the asymmetric view will substantially increase both, recognition of data distributions, and interpretation quality. It will allow for savings in sample size that can be considerable. Moreover, this is in line with ethical responsibility. Adequate models will improve concepts and theories, and provide deeper insight into science and life.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparing the Efficiency of Dmus with Normal and Skew-Normal Distribution using Data Envelopment Analysis

  Data envelopment analysis (DEA) is a nonparametric approach to evaluate theefficiency of decision making units (DMU) using mathematical programmingtechniques. Almost, all of the previous researches in stochastic DEA have been usedthe stochastic data when the inputs and outputs are normally distributed. But, thisassumption may not be true in practice. Therefore, using a normal distribution wi...

متن کامل

Efficiency Evaluation by using mixed modeling of Data Envelopment Analysis and Balanced Scorecard- A Case Study in the banking industry

The first objective in any financial organization is to improve performance, and performance evaluation also is one of the best ways to advance operations in organizations. By utilizing different methods of performance evaluation, organizations can evaluate the effectiveness and efficiency of processes that are in accord with strategic objectives. In addition, the performance evaluation instrum...

متن کامل

A New Dynamic Random Fuzzy DEA Model to Predict Performance of Decision Making Units

Data envelopment analysis (DEA) is a methodology for measuring the relative efficiency of decision making units (DMUs) which ‎consume the same types of inputs and producing the same types of outputs. Believing that future planning and predicting the ‎efficiency are very important for DMUs, this paper first presents a new dynamic random fuzzy DEA model (DRF-DEA) with ‎common weights (using...

متن کامل

Designing a new multi-objective fuzzy stochastic DEA model in a dynamic ‎environment to estimate efficiency of decision making units (Case Study: An Iranian Petroleum Company)

This ‎paper presents a new multi-objective fuzzy stochastic data envelopment analysis model          (MOFS-DEA) under mean chance constraints and common weights to estimate the efficiency of decision making units for future financial periods of them. In the initial MOFS-DEA ‏model, the outputs and inputs are ‎characterized by random triangular fuzzy variables with normal distribution, in which ...

متن کامل

Stochastic DEA with Using of Skew-Normal Distribution in Error Structure

The stochastic data envelopment analysis (SDEA) was developed considering the value ofinputs and outputs as random variables. Therefore, statistical distributions play an importantrole in this regard. The skew-normal (SN) distribution is a family of probability densityfunctions that is frequently used in practical situations. In this paper, we assume that the inputand output variables are skew-...

متن کامل

Application of quality function deployment (QFD) to improve product design: The school furniture case

Today Quality Function Deployment (QFD) is a powerful development method whit a wide range of applications to translate customers’ needs into technical requirements for achieving customer satisfaction. The current study demonstrated a QFD analysis to improve school furniture design in Tehran as the baseline of Iran. Accordingly, we extended the widely used QFD method into a complex set of custo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 6  شماره 

صفحات  -

تاریخ انتشار 2011